Association Rule Generation and Evaluation of Interestingness Measures for Artwork Tags
نویسنده
چکیده
Finding associations between groups of items within a data set is referred to as the problem of association rule mining. This can be decomposed into the two subproblems frequent itemset mining and association rule generation. The open source data mining framework ELKI used throughout this work for experiments so far only supports the first subproblem. Therefore the implementation of association rule generation and its integration in ELKI is part of this work. The ARTigo project is an online platform with the purpose of powering a semantic artwork search engine. By playing special games on the projects website users contribute descriptive tags to artworks. These collected keywords can be analyzed using various data mining techniques. This work concentrates on applying association rules mining to the ATRigo dataset, using various interestingness measures. The main part of this thesis is to evaluate appropriate interestingness measures for the given ARTigo data. Therefore, ten interstingness measures are first searched for similarities. Second, their ability of eliminating uninteresting and locating interesting association rules is compared. Last their one thousand highest ranked rules are analyzed by the means of five proposed properties.
منابع مشابه
Numeric Multi-Objective Rule Mining Using Simulated Annealing Algorithm
Abstract as a single objective one. Measures like support, confidence and other interestingness criteria which are used for evaluating a rule, can be thought of as different objectives of association rule mining problem. Support count is the number of records, which satisfies all the conditions that exist in the rule. This objective represents the accuracy of the rules extracted from the da...
متن کاملModeling of the counter-examples and association rules interestingness measures behavior
Association rules discovery is one of the most important tasks in Knowledge Discovery in Data Bases. Since the initial APRIORI algorithm, many efforts have been done in order to develop efficient algorithms. It is well known that APRIORI-like algorithms within the (unsatisfying) support/confidence framework may produce huge amounts of rules and thus one of the most important steps in associatio...
متن کاملSemantic interestingness measures for discovering association rules in the skeletal dysplasia domain
BACKGROUND Lately, ontologies have become a fundamental building block in the process of formalising and storing complex biomedical information. With the currently existing wealth of formalised knowledge, the ability to discover implicit relationships between different ontological concepts becomes particularly important. One of the most widely used methods to achieve this is association rule mi...
متن کاملCombining Clustering techniques and FCA to characterize Interestingness Measures
Formal Concept Analysis "FCA" is a data analysis method which enables to discover hidden knowledge existing in data. A kind of hidden knowledge extracted from data is association rules. Di erent Interestingness Measures "IMs" were reported in the literature to extract only relevant association rules. Given a dataset, the choice of a good interestingness measure remains a challenging task for a ...
متن کاملAssociation rule mining with a correlation-based interestingness measure for video semantic concept detection
Content-based multimedia retrieval and automatic semantic concept detection research areas have been motivated by the high demands of multimedia applications and services. Due to its high efficiency and good performance, association rule mining (ARM) has been adopted to discover the association patterns from the multimedia data and predict the target concept classes in various media types. As a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016